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the redshifted 21cm line over the sky in a range of redshifts without the requirement to resolve 
individual galaxies. In this chapter, we investigate the potential of SKAl to deliver HI intensity 
maps over a broad range of frequencies and a substantial fraction of the sky. By pinning down the 
baryon acoustic oscillation and redshift space distortion features in the matter power spectrum - 
thus determining the expansion and growth history of the Universe - these surveys can provide 
powerful tests of dark energy models and modifications to General Relativity. They can also be 
used to probe physics on extremely large scales, where precise measurements of spatial curva¬ 
ture and primordial non-Gaussianity can be used to test inflation; on small scales, by measuring 
the sum of neutrino masses; and at high redshifts where non-standard evolution models can be 
probed. We discuss the impact of foregrounds as well as various instrumental and survey design 
parameters on the achievable constraints. In particular we analyse the feasibility of using the 
SKAl autocorrelations to probe the large-scale signal. 
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1. Introduction 

The cosmic microwave background (CMB) has been one of the main observational tools 
for cosmology in recent years. Although basically only giving 2-dimensional information, we 
were able to constrain the standard cosmological model with great accuracy (Planck Collaboration 
2013). This "high precision cosmology" is particularly true for the “vanilla" model with 6 parame¬ 
ters. More parameters or non-standard models can lead to degeneracies and limit the constraining 
power of the CMB (for instance the wolwa non-flat model). The next step towards precision cos¬ 
mology and exploring novel models will need to use extra information. In particular, due to its 
huge information content, measurements of the 3-dimensional large-scale structure of the Universe 
across cosmic time will be an invaluable tool. One of the most accessible methods to probe this is 
through large galaxy surveys to trace the underlying dark matter distribution. Several surveys are 
now under way or in preparation, such as BOSS (SDSS-III), DBS, eBOSS, DESI, 4MOST, LSST, 
and the Euclid satellite. These surveys are based on imaging of a large number of galaxies at op¬ 
tical or near-infrared wavelengths combined with redshift information to provide a 3-dimensional 
position of the galaxies. 

The new generation of radio telescopes now under construction will provide even larger and 
deeper radio continuum surveys, capable of detecting galaxies above redshift 3. This is particularly 
true for the SKA, as discussed in Jarvis et al. (2014). Although they can provide important con¬ 
straints on cosmological parameters, they still lack redshift information from the radio that would 
provide further improvements on the constraints, in particular for the dark energy evolution. A 
solution is to use the hydrogen 21cm line to provide the redshift information. Telescopes probing 
the sky between a rest frequency of 1420 MHz and 250 MHz would be able to detect galaxies up 
to redshift 5. The problem is that this emission line is usually quite weak: at z = 1.5, most galaxies 
with a HI mass of 10^M© will be observed with a flux density of ~ 1 /xJy using the HI line. 

In order to obtain “game changing" cosmological constraints, we showed in Santos et al. 
(2014) that experiments with sensitivities better than lOjiJy over 10 kHz channels will then be 
required to provide enough galaxies to beat shot noise and become cosmic variance dominated. 
Although “near term" radio telescopes such as ASKAP and MeerKAT should be able to achieve 
such sensitivities on deep single pointings, it will require a much more powerful telescope such 
as SKA Phase 2, to integrate down to the required sensitivity over the visible sky in a reasonable 
amount of time. This would imply that one would need to wait until then to use radio telescopes 
for cosmology. 

Galaxy surveys are threshold surveys in that they set a minimum flux above which galaxies 
can be individually detected. Instead we could consider measuring the integrated 21cm emission 
of several galaxies in one angular pixel on the sky and for a given frequency resolution. Eor a rea¬ 
sonably large 3d pixel we expect to have several HI galaxies in each pixel so that their combined 
emission will provide a larger signal. Moreover we can use statistical techniques, similar to those 
that have been applied for instance to CMB experiments, to measure quantities in the low signal 
to noise regime. By not requiring the detection of individual galaxies, the specification require¬ 
ments imposed on the telescope will be much less demanding. This is what has been commonly 
called an “intensity mapping" experiment. It is similar to what is being planned for experiments 
aimed at probing the Epoch of Reionization (at z > 6), such as the ones using the radio telescopes 
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LOFAR, MWA and PAPER. By not requiring galaxy detections, the intensity mapping technique 
transfers the problem to one of foreground cleaning: how to develop cleaning methods to remove 
everything that is not the HI signal at a given frequency. This in turn also impacts on the calibration 
requirements of the instrument. 

This chapter describes the HI intensity mapping surveys that are feasible with the SKA, list¬ 
ing the assumed telescope specifications as well as the corresponding sensitivity and the different 
cosmological constraints that can be achieved. A discussion of foreground contamination and cal¬ 
ibration requirements is also included. For a summary of the expected cosmological constraints 
with SKAl intensity mapping surveys, we refer the reader to section 7 and in particular Fig. 5. 


2. The signal 


After reionization, most neutral hydrogen will be found in dense systems inside galaxies, e.g. 
Damped Fyman-alpha Absorbers (DFAs). In terms of the brightness temperature, the average 
signal over the sky can be written as: 

T,(,-).566/.(^)(|;f){l+z)^MK. (2.1, 

where the neutral hydrogen density fraction is given by 

^Hi(z) = (1 +z)^^Phi(z)/Pc,0) (2-2) 


Phi (z) is the proper HI density and Pc,o the critical density of the Universe at redshift zero. Figure 1 
shows constraints on Dhi(z) from different experiments. At low redshifts, it is measured using 21 
cm observations directly from galaxies (except the GBT point which uses intensity mapping). At 
high redshifts, Qhi is estimated by computing the HI associated with Damped Fyman-a systems 
observed in absolution in quasar spectra. These systems are easy to identify, given their prominent 
damping wings in both high-resolution and low-resolution data even at low signal-to-noise, and a 
HI column density is inferred by Voigt profile filling. This is in lurn easily franslaled info a value 
for Dhi. Presenl conslrainfs infer a consfanf Dhi af z = 2 — 4, while al higher redshifl Ibis value is 
expecfed fo increase, as fhe Universe is becoming more neufral. For a recenl summary of observed 
frends we refer fo Padmanabhan ef al. (2014) 

Assuming fhe signal is linear wilh respecf fo fhe underlying dark matter llucfuafions, fhe fofal 
brighfness femperalure af a given posilion on fhe sky and frequency will be 


Tb{v,Q.)^Tb{z) l+bm 5 m{z) 


1 dv- 
H{z) ds. 


(2.3) 


The signal will Ihen be complefely specified once we find a prescription for fhe HI densify and bias 
function (ZiHi)- This can be oblained by making use of fhe halo mass funclion, ^ and relying on a 
model for fhe amounl of HI mass in a dark matter halo of mass M, e.g. Mhi(M), so lhal 


/'^max (jyi 

Phi(z) = / (M,z)Mhi(M,z), (2.4) 

dM 

f^max (ifi 

bm{z)=Pul dM—{M,z)Mni{M,z)b{z,M), (2.5) 
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Figure 1: Left: Current constraints on the HI density fraction as a function of redshift (Meiring et al. 2011; 
Prochaska & Wolfe 2009; Noterdaeme et al. 2009; Rao et al. 2006; Martin et al. 2010; Lah et al. 2007; 
Zwaan et al. 2005; Khandai et al. 2011), partially based on the compilation in Duffy et al. (2012). DLA 
observations are shown in blue, cross-correlations in orange, other observations in red, and simulations in 
green. The thick black line shows nHi(z) from the fiducial model power law used throughout this chapter. 
Right: Evolution of the brightness temperature times bias with redshift for the linear (red curve) and our 
fiducial power-law model (blue). 


where the halo mass function, should be in proper units and b{z,M) is the halo bias. 

Some concerns might arise due to the possible stochastic behaviour of the function Mhi(M) 
or its dependence with the environment (so that it would be a function of position also). However, 
given the low resolution pixels used in HI intensity mapping experiments, we expect a large number 
of HI galaxies per pixel, which should average-down any fluctuations and allow us to take the above 
deterministic relation for the mass function. For example, for the typical scales we are interested in 
Cosmology, one needs angular/frequency resolutions of around 1 degree and 5 MHz respectively, 
which translates into a comoving volume of ~ 10^ Mpc^. In each volume element, we expect a 
total of around 10^ dark matter halos with mass between 10^ — IO^^Mq, and ~ 31,000 with masses 
between 5 x 10^ and 1 X IO^^Mq (where the latter range corresponds to halos expected to contain 
most of the HI mass). This supports our assumption of a position-independent HI mass function 
due to the averaging over many halos. Some level of stochasticity could still increase the shot noise 
of the signal, but this is expected to be quite small. 

For the mass function, the most straightforward "ansatz" would be to assume that it is pro¬ 
portional to the halo mass - the constant of proportionality can then be fitted to the available data. 
Even in this case however, we need to take into account the fact that not all halos contain galaxies 
with HI mass. Following Bagla et al. (2010), we can assume that only halos with circular velocities 
between 30 — 200 kms^^ are able to host HI. This translates into a halo mass through 

_ / \ V3 

Vcirc = 30x/rT^ j 
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Unfortunately this option fails to fit well the HI density measurements at high-z. A more evolved 
option would be to consider the proportionality to the mass as a function of redshift. This would 
at least guarantee the fit to the density measurements by construction. Throughout this chapter, we 
decided to consider instead a simple power law model of the halo mass (M): 

Mhi(M)=AM“, (2.7) 

which is independent of redshift. We found that a value of a ~ 0.6 fits both the low z and high z 
data reasonably well. This can be seen in figure 1 (left), that shows the flHi(z) measurements and 
the evolution obtained from this model (solid line). The constant A is normalised to the results from 
Switzer et al. (2013) at z ~ 0.8. The right panel shows the redshift evolution for both the linear 
and power law model of the temperature multiplied by the bias, which is the figure of merif for fhe 
strength of the power spectrum used in the forecasts. 

Another issue is whether we can assume that the bias is scale dependent. Again, as long as 
we restrict ourselves to large scales, this should be a reasonable assumption since we are averaging 
over many galaxies. Results from simulations show that the bias can be safely assumed constant for 
k < \ h/Mpc at high redshifts (while at z < 1, it should be safe for ^<0.1 h/Mpc). Note that this 
bias can also be modelled using a variety of relatively simple prescriptions on top of the outputs of 
large volume and high resolution hydrodynamic or N-body simulations (Villaescusa-Navarro et al. 
2014). These models capture the essential features of the HI distribution and aim at mimicking 
more complex physical effects (e.g. radiative transfer) by assigning HI to each (dark matter or 
gas) paiticle of the simulated volume. Although less sophisticated than the framework presented 
in Obreschkow et al. (2009), these models work remarkably well, allowing us to quantitatively 
address the scale dependence of the bias. They show that at z > 2 the HI bias steepens for scales 
k> \ Mpc^^ in a way which depends on the actual HI modelling as can be seen in Fig. 2. A 
summary of how uncertainties in the bias signal propagate into the 21 cm HI power spectrum has 
been recently provided by Padmanabhan et al. (2014). 

3. Simulations 

There exists a number of practical challenges to intensity mapping, for instance the problem 
of foregrounds (see section 4 below), that must be addressed in order to maximize the amount and 
quality of the scientific information that can be extracted. Quantifying the exact statistical and sys¬ 
tematic uncertainties for a practical experiment analytically can be an unsurmountable task, and as 
is now the trend in most cosmological observations, simulations must be used, which must describe 
both the cosmological signal we expect to measure and all other processes (e.g. foregrounds, in¬ 
strumental effects, etc.) that may have a significant effect on the recovered data. Ideally we would 
want these simulations to yield the most realistic state-of-the-art description possible, however this 
is not always feasible if a large number of independent realizations need to be generated in order 
to quantify the aforementioned uncertainties. A compromise between computational speed and 
complexity must be met, so that enough simulations can be run, while correctly reproducing the 
relevant physics. 

In the case of the cosmological signal for intensity mapping it is possible to accomplish this by 
using simplified mefhods fo generafe realizafions of fhe maffed densify field fhaf follow fhe correcf 
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Figure 2: HI simulations from Villaescusa-Navarro et al. (2014). Left: HI bias as a function of the wave- 
number at z = 2.4 for three different ways of modelling the HI within and outside haloes. Halo-based model 
1 refers to a modihcation of the modelling performed by Bagla et al. (2010); halo-based model 2 is built in 
order to fit the recent BOSS constraints on the Damped Lyman-a systems; particle-based model is inspired 
by the work of Dave et al. (2013) which assigns HI both within and outside haloes and hts constraints of 
intergalactic medium data. Note the different scale dependence of the three models and the large bias of the 
halo-based model 2. Right: for the same three models we show the evolution of the HI bias with redshift in 
the range probed by the simulations. 

distribution on large scales. Along these lines, a lot of work has been done in the last years within 
the community of galaxy redshift surveys on producing fast but accurate methods to generate mock 
simulations of the galaxy distribution (Tassev et al. 2013; White et al. 2014). One of the goals of 
such models is to populate dark matter haloes with a realistic distribution of neutral hydrogen. To 
do this, N-body or hydrodynamic simulations need to be performed at relatively high resolution to 
properly resolve the smallest haloes that can host HI. This framework has the following features: 
i) it is simple - we refer to Villaescusa-Navarro et al. (2014) for a comprehensive review in which 
three different methods based on Bagla et al. (2010) (halo based) and on Dave et al. (2013) (particle 
based); ii) it can be easily complemented by a Halo Occupation Distribution model in order to 
cross-correlate HI properties with those of a realistic galaxy population; Hi) it allows us to address 
quantitatively the bias scale dependence (as we discussed above) and the amount of HI which could 
reside outside haloes; iv) it is bound to reproduce basic observable quantities of the HI distribution 
at high and low redshift (like for example Lyman-a forest absorption lines); v) it can be translated 
promptly into realistic observed HI maps, once the noise/instrumental properties are known. One 
recent potentially interesting finding by Font-Ribera et al. (2012) with the SDSS-BOSS survey, 
and using the properties of Damped Lyman-a systems, have demonstrated that HI is hosted in 
relatively high mass haloes. If we require the simulations to reproduce this, then the HI bias will 
be larger by a factor of two compared to models for which we disregard these observations (this is 
the halo-based model 2 of Fig. 2). 

The case of the foregrounds is, however, more complicated. While it is possible to use a 
few datasets and certain empirical models to produce conservative realizations of the radio fore¬ 
grounds (de Oliveira-Costa et al. 2008; Shaw et al. 2013; Alonso et al. 2014b), the lack of full-sky 
multi-frequency data prevents us from developing truly realistic simulations of the radio sky. This 
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Figure 3: Left; Frequency-dependence of the different foregrounds and the cosmological signal at different 
galactic latitudes (given in the top right comer of each panel), according to the simulations of Alonso et al. 
(2014b). A 1 % polarization leakage was assumed for this simulation. Right: Angular power spectrum of the 
foreground-cleaned temperature map at 600 MHz for different numbers of foreground degrees of freedom 
(A^fg). using a PCA approach. The method converges to the tme cosmological signal for Afg = 6. 

situation will improve in the future as better quality data is obtained by the new radio observatories, 
which will occur at the same time as the first intensity mapping observations. Alonso et al. (2014b) 
have recently created a publicly available code to generate fast IM mock observations, including 
the cosmological signal, foregrounds and some simple instrumental effects. This code can be used, 
for example, to study the influence of foreground subtraction on the recovered cosmological signal, 
or to analyse the effects of different instrumental configurations (see figure 3). 

4. Foregrounds 

One of the most important challenges facing HI intensity mapping is the presence of fore¬ 
grounds (both galactic and extra-galactic) with amplitudes several orders of magnitude larger than 
the signal to be measured. Because the frequency structure as well as other statistical properties 
of the foregrounds are significantly different from those of the cosmological signal, it is not un¬ 
reasonable to hope that they can be successfully subtracted (Di Matteo et al. 2002; Oh & Mack 
2003; Santos et al. 2005; Morales et al. 2006; Wang et al. 2006; Gleser et al. 2008; Jelic et al. 2008; 
Liu et al. 2009; Bernardi et al. 2009, 2010; Jelic et al. 2010; Moore et al. 2013; Wolz et al. 2014a; 
Shaw et al. 2013). Although a lot of work in terms of simulations and testing cleaning techniques 
has already been done, we still face huge challenges ahead, in particular if we want to use this 
signal for high precision cosmology. Increasingly realistic large simulations should be developed 
to try to test the limitations of the intensity mapping measurements. This should include as many 
instrumental effects as possible in order to account for possible contamination from the calibration 
process. Ultimately, we will need to start analysing real data in order to improve and build up our 
knowledge towards the SKA. 
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4.1 Foreground classification 

Foregrounds for intensity mapping ean essentially be elassified as being extragalaetie (eaused 
by astrophysieal sourees beyond the Milky Way) or galaetic in nature. 

The most relevant galaetie foregrounds are galaetic synchrotron emission (GSE) and galactic 
free-free emission. GSE is caused by high-energy cosmic-ray electrons accelerated by the Galactic 
magnetic field, and is by far the dominant foreground for HI intensity mapping, being up to five 
orders of magnifude larger fhan fhe cosmological signal. Eor fhe relevanf radio frequencies if 
should be possible fo describe GSE as a power-law in frequency wifh a specfral index j3 ~ —2.8 
(Dickinson el al. 2009; Delabrouille el al. 2013). Eurlhermore, GSE is parlially linearly polarised. 
Ils polarised pari will be affected by Ihe Earaday effecl, a rolalion of Ihe polarizalion angle caused 
by Ihe Galactic magnetic field and Ihe optically Ihick interstellar medium (Rybicki & Eighlman 
1986; Waelkens el al. 2009). Since Ihe dependence of Ibis effecl on frequency is quite slrong 
for Ihe frequency range lhal perlains lo intensify mapping, any leakage from Ihe polarized pari 
into Ihe unpolarized measuremenls of Ihe cosmological signal would generate a very Iroublesome 
foreground to sublracl (Jelic el al. 2010; Moore el al. 2013). On Ihe olher hand, free-free emission 
is caused by free elections accelerated by ions, and Ihus Iraces Ihe warm ionised medium. As in 
Ihe case of GSE, free-free emission is predicted to be speclrally smoolh in Ihe relevanl range of 
frequencies (Dickinson el al. 2003). 

Exlragalaclic radio sources can be classified into Iwo differenl categories: brighl radio galax¬ 
ies, such as active galactic nuclei, and “normal” slar-forming galaxies. The spatial distiibulions of 
Ihese Iwo lypes should be qualilalively differenl, Ihe former being less dominated by gravilalional 
clustering and more by Poisson noise. This has an impacl on Ihe degree of smoolhness across fre¬ 
quency of Ihe observed intensify when considering Ihe combined contiibulion of all Ihe galaxies in 
a given direction of Ihe sky (Santos el al. 2005; Zaldarriaga el al. 2004). 

Olher possible foreground sources are almospheric noise, radio frequency interference and line 
foregrounds, caused by line emission from aslrophysical sources in olher frequencies. Due to Ihe 
specfral isolation of Ihe 21cm line, logelher wilh Ihe expected low intensify of Ihe mosl potentially 
harmful lines (such as OH al Vqh ~ 1600MHz), Ihe HI signal should be very robusl againsl line 
confusion. 

4.2 Foreground subtraction 

The problem of foregrounds has been addressed in the literature mainly within the EoR regime. 
The different algorithms that have been proposed to date can be classified into blind (Wang el al. 
2006; Switzer el al. 2013; Chapman el al. 2012) and non-blind (Eiu & Tegmark 2011; Shaw el al. 
2013, 2014) methods, depending on the kind of assumptions made about the nature of the fore¬ 
grounds (e.g. whether only generic properties such as spectral smoothness and degree of correla¬ 
tion are assumed or whether a more intimate knowledge of the foreground statistics is required). 
The poor observational constraints on the foregrounds in the relevant range of frequencies justi¬ 
fies considering the use of blind methods. Recently Wolz et al. (2014a) studied the effectiveness 
of independent component analysis (in particular the implementation of EastlCA, e.g. Hyvarinen 
1999) for intensity mapping. By propagating the foreground removal residuals into the cosmo¬ 
logical analysis, they showed that, while foreground cleaning may induce a residual bias on large 
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angular scales, which could prevent a full analysis based on the shape of the temperature power 
spectrum, robust features like the BAO scale should remain unaffected. This result is reasonable: 
most relevant foregrounds are (fortunately) exceptionally smooth and therefore it should be pos¬ 
sible to distinguish them from the much “noisier” cosmological signal. Any foreground residual 
will probably be dominated by galactic synchrotron emmission, which is most relevant on large 
angular scales. In the same context, Alonso et al. (2014a) studied the efficiency of different blind 
cleaning methods (see figure 3), showing fhaf foreground removal can be successful over a wide 
range of scales provided fhe foregrounds are sufficienfly smooth, and that all blind methods yield 
quantitatively similar results. 

In the realm of non-blind methods, the parametric eigenvalue algorithm developed by Shaw 
et al. (2013) for the CHIME experiment decomposes the data with help of statistical models for 
both foregrounds and 21cm signal. This algorithm leaves minor foreground residuals in the large 
modes of the power spectrum. Instrumental errors such as polarization leakage, beam deforma¬ 
tions and calibration uncertainties can significantly affect the foreground removal by mode mixing 
effects. Shaw et al. (2014) advanced their description to polarized data considering a number of 
instrumental errors in their tests. For future SKA experiments, detailed studies including varying 
instrumental settings and the impact of the residuals on the power spectrum are required in order 
to minimize bias on cosmological results. Foreground subtraction for the SKA is treated in detail 
in Wolz et al. (2014b) including realistic simulations. 

4.3 Polarisation leakage 

Although the cosmological signal is unpolarized, sky polarization can represent an additional 
foreground source due to imperfect calibration. The problem can be described by using the mea¬ 
surement equation formalism (Hamaker et al. 1996; Smirnov 2011) that describes the propagation 
of the signal through an interferometric array. A pedagogical view can be presented by using the 
scalar form of the measurement equation (Sault et al. 1996), which relates the measured visibilities 
to the (7, Q,U,V) Stokes parameters that describe the true sky brightness distribution: 

V^T = \sf8‘;-{i + Q) 

= \gU)iu+iv) 
y?f = {glg^u-iv) 

Vu = \ghp-Q)- ( 4 - 1 ) 

Here, V represents the visibility between antennas {i,j) and polarisations {p,q), e.g. it represents 
the cross-correlation of the electric fields measured by each anfenna (fhe oufpuf of fhe correlator). 
The case of "single dish" observations can be simply represenfed by pulling i = j. The aclual 
response of fhe lelescope fo fhe inpuf sky is represented here by g, usually referred as fhe "gain" of 
fhe syslem. More inslrumenlal effecls can be included in Ihese equalions bul fhe simple approach 
above is enough lo show fhe effecl. 

If we can calibrale the instrument perfectly, then we can effectively renormalise the gains 
above (i.e. set g^ = g‘^ = 1) and obtain the measured intensity as lij = = I (again this is 
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also valid for i = j). However, in the presence of calibration errors, there will be an uncertainty in 
these gains, e.g. = 1 + This will effectively translate into an error in the estimated total 
intensity: 4,; = I + dl + dQ where dl = ^ {dgf + dg^j + dg^ + r/gp is the usual assumed error due 
to inaccurate calibration and dQ= ^(dgf + dg^j — dg‘^j — dg‘j) is the polarization term that leaks to 
the total intensity (in the expressions above we assumed small calibration errors). Concentrating in 
the auto-correlations we have dl = l{dgP + dg‘^) and dQ = Q{dg^^ — dg‘1). We then see that even if 
there is the usual calibration error, the leakage will be zero as long as the error is the same for both 
polarizations. 


Although the polarization leakage is different for different instruments, typical values for leak¬ 
ages are below 1% and they tend to be reasonably stable over time scales of hours. In this case, the 
greatest contamination could come from off-axis leakage, i.e. signals entering the telescope from 
directions other than the pointing one. Their magnitude can be far greater (i.e. up to 30%), de¬ 
pending upon the observing frequency and their time variability. All the algorithms for foreground 
subtraction rely on the frequency smoothness of the total intensity spectrum. This smoothness is 
theoretically well motivated (Santos et al. 2005; Petrovic & Oh 2011; Bernardi et al. 2014) but no 
longer holds for polarization as the Stokes Q, U parameters are Faraday rotated when the radiation 
goes through an ionized medium. The emerging scenario is that intrinsically smooth synchrotron 
radiation can be contaminated by non-smooth polarized emission due to imperfect calibration. This 
additional foreground seems to be the limiting factor of current HI mapping measurements (Switzer 
et al. 2013). Note however that we can in principle model this contamination from the leakage of 
faraday rotated polarisation by "looking" at polarised point sources with the telescope. 


Unlike the EoR case, both point-like and diffuse Galactic polarized emission may be prob¬ 
lematic for intensity mapping at z ~ 1 — 2. The average polarization fraction of extragalactic radio 
sources is ~5% at 1.4 GHz (Tucci et al. 2004)) with RM values up to a few tens of rad m^^ at 
high Galactic latitude where HI intensity mapping is carried out (Simard-Normandin et al. 1981; 
Taylor et al. 2009). The properties of Galactic synchrotron polarization are much less known at the 
frequencies relevant to HI intensity mapping. It is fairly observationally established that the spatial 
distribution of polarized intensity poorly correlates with total intensity at 1.4 GHz due to small scale 
structure present in the ionized interstellar medium (i.e. Gaensler et al. 2001; Bernardi et al. 2003; 
Sun et al. 2011). Observations of supernova remnants also show that objects further away than a 
few kpc are completely depolarized at 1.4 GHz, indicating the presence of a polarization horizon 
beyond which diffuse polarization is no longer observable (Sun et al. 2011). The distance of such 
polarization horizon decreases at lower frequencies, down to a few hundreds pc at 150-300 MHz 
(Haverkorn et al. 2004; Bernardi et al. 2013), indicating that relativistic and thermal plasma are 
co-located in the interstellar medium (Burn 1966). Typical RM values for Galactic polarization 
also decrease with decreasing frequencies. Given the complex spatial and frequency properties of 
Galactic polarization, extrapolations to the frequencies relevant for HI intensity mapping observa¬ 
tions are fairly uncertain, although we expect that a significant improvement will happen in the 
next years due to new surveys. 
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5. Experimental considerations 
5.1 Noise 

Most cosmological applications with HI intensity mapping will rely on the use of statistical 
quantities, in particular the power spectrum or its equivalent in real space - the 2-point correlation 
function. To that effect, the noise power spectrum is a prime quantity to access the sensitivity of a 
given experiment (and associated survey) to detect the cosmological signal. Two main setups can 
be considered: surveys using single dish observations where the auto-correlation signal from one 
or more dishes is used, or surveys using interferometers where the cross-correlation signal from the 
array elements is used. 


5.1,1 Auto-correlations 

For single dish observations, the noise temperature rms is given by 


(Jt 


AsAQ.y^25vtp 


sys 


ly/ldVtp 


(5.1) 


where AQ. is the beam area of the telescope, Ag its effective collecting area, 8v the frequency 
resolution, tp the observing time per pointing, A the wavelength of the observation and the 
total system temperature. The factor of 1 / s/l takes into account the fact we have 2 polarisations. 
The second approximation takes AQ. to be the square of the FWHM of the beam and uses a fudge 
factor e Ri 1 to factor in the efficiency of the telescope. Pointings need to be packed so that the 
measurement is reasonably continuous across the sky. Usually this means pixels of size 0g ps 
(7i:/ 8)(1.3A/D)^ [sr] or smaller. Given a total observing time ttot> the time per pointing is then 
tp = hot /Sarea where S'area is the survey area. The 3D noise power spectrum is just Pn = Oj Vpix, 

where Upix = (rPe)^ x (y^v) is the 3D comoving volume of each volume element, r is the comoving 
distance to the redshift of the signal andy = +zf'/V 2 i- Therefore, the cancels in the 

power spectrum and we finally get 


Pn = t^y 


T2 c 
^ sys'^^ea 

2£^ttot 


(5.2) 


As we can see, the dish size drops out of the final expression - it is only relevant for the angular 
resolution which should match what is required for the signal we are trying to measure (to sum¬ 
marise, the dish collecting area and beam size are connected and will cancel out and the way we 
"pack" the beams for mosaicking is connected to the assumed pixel resolution in the map and will 
cancel out in the power spectrum). For a fixed total observation time, the survey area should also 
be chosen to match the required angular scales. If the noise power spectrum is similar to the signal, 
one also gains by increasing the survey area since this will increase the number of independent 
measurements for a given scale. 

If we have Nd dishes, the combined noise power spectrum will be Pn = r^yP^s^aiea/i'^^^Ndttot)- 
For a single dish with Nb beams, we can cover the same sky area in less time, so that the noise power 
spectrum will go as Pn = r^yP^s^aiea/{'^^^NdNbkot)- With Phased Array Feeds (PAFs) the situation 
is slightly more complicated as the feeds are packed in order to allow for a large number of beams. 
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This will imply some amount of beam overlap below a given eritieal frequeney 

-2 c I 1 V > Vcrit 

,2 


Pn = r y 


T2 c 


le^NdNbho 


X 


(5.3) 


V< Vcrit 

where VcUt is the PAF eritieal frequeney. Note that S'area > (with oc 1/v^ as usual). 

5.1.2 Interferometer 

For observations with interferometers, we start by eonsidering the noise rms in the uv plane 
for a "uv" pixel of size (An)^: 


ar(u,v)= 


^ sys 


(5.4) 


Aeyi5vn{u) {Au^tp ’ 

where Ag is the effeetive eolleeting area of one element (dishes or stations), tp the time per pointing 
and n(u) is the average number density of baselines (averaged over a 24h period), usually only a 
funetion of |u|. 

For interferometers, we are going to assume that mosaieking different pointings will not allow 
the reeovery of angular scales larger than the telescope field of view (the primary beam), which is 
basically set by the size of the array elements, e.g. ~ [^r], where D is the dish or station 

diameter. Usually, the total observing time ftot and time per pointing tp are then the same. On the 
other hand, we can use different pointings to increase the number of independent measurements on 
scales smaller than the telescope field of view. The fime per pointing should fhen be decreased as 
tp = hoi/Np, where Np is fhe number of poinfings. The 3d noise power specfrum is fhen given by 


AV(k, v) = 




(5.5) 


wifh u = r{z)kj^/ (III) and Np = S-, 
plane befween some minimum D, 


2A^n{u)t tot 2A^gdi{v)n{u)t tot 

;a/6g. Nofe fhaf if we assume fhaf n{u) is consfanf on fhe uv 
nun and maximum D^ax baseline, fhen we can write 

NaiNa-l)A^ 


n{u) = 


(5.6) 


2n{Dl^-DiJ 

where Na is fhe number of elemenfs of fhe inlerferomefer confribufing fo fhaf baseline range. The 
expression follows by nofing fhaf fhe infegrafion over n{u) should give fhe fofal number of base¬ 
lines. In fhe analysis below, we considered fhe full n{u) disfribufion^ 

For PAFs, each of fhe beams will be cross-correlafed wifh fhe corresponding beam from a 
differenl dish (e.g., fhe field of view of fhe inlerferomefer is sef by fhe size of one of Ihese beams). 
The fofal number of beams per dish, Nb, will allow fo survey a largel area more quickly and fhus 
increase tp = UotNbQ^/Sw&a- However, we need fo consider again fhe beam overlap below fhe 
critical frequency so fhaf 

2 


Pn = 




2AlNbn{u) [08 (Vcrit)] hot 


X < 


( — ) 

V Vcrit J 


V > Vcrit 


V < Vcrit 


(5.7) 


^ files with n(u) for the different telescope setups are available at https://gitorious.org/radio-fisher/bao21cm.git 
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Note that the eolleeting area of one element of a dish array, Ag ean be written as Ag ps nD^e/A with 
£< 1 - 

If we eonsider aperture arrays sueh as what is used for SKA-LOW, the situation is slightly 
different. In that ease, above a given eritieal frequeney, the area of a station will go as 1/v^, being 
eonstant below that (when the array beeomes dense). On the other hand, the array beam 6 b should 
go as 1 / V at any frequeney (set by the size of the array) so that 


Pn = 




2 [Ae(Vcrit)]^ Bl{v)Nbn{u)ttoi 


X < 


f—V 

V Writ J 

1 


V > Vcrit 

V < Vcrit 


(5.8) 


where we are already eonsidering the possibility of multiple beams {Nb) with SKA-LOW. 


5.1.3 Total error 

Computation of the total error in the measurement of the power speetrum at a given seale, P{k), 
will require knowledge of both the noise and the number of independent modes used to measure 
that seale, sinee the error is ~ {P{k) +PN{k)) /\/Amodes- These number of modes will be related 
to the volume of the survey and optimisation will depend on the balanee between redueing noise 
and inereasing the number of modes. Moreover, issues sueh as the large k eutoff along the angular 
and frequeney direetions need to be faetored in due to the resolution of the experiment. All these 
details have been taken into aeeount in the foreeasting and fully deseribed in Bull et al. (2014a). 

5.2 Target scales 

The seales we want to probe for eosmology will impose requirements on the teleseope spee- 
ifieations (or the other way around). In terms of probing baryon aeoustie oseillations (BAO - see 
Bull et al. 2014b), the relevant seales ean be translated to: 

• Angular seales between 30 areminutes and 4 degrees. 

• Frequeney seales between 2 MHz and 35 MHz 

• Surveys eovering large areas of the sky are required in order to inerease the statistieal detee- 
tion. 

• A large bandwidth is required in order to maximize the redshift range eovered. Ideally: > 350 
MHz (0 < z < 3). Though a split in bands is probably required: the low frequeney part will 
be important for large seale physies and "non-standard" eosmologieal eonstraints while the 
high frequeney/low z part will be useful to probe the more standard "vanilla" models. Sinee 
there will be other surveys probing this low z region we suggest to foeus on the < 1000 MHz 
region if a split in bands needs to be made. 

For GR eorreetions and non-Gaussianity, we are interested on ultra-large seales, e.g. modes k < 
0.01 Mpe^^ This should allow high preeision measurements to be made of the turn over seale in 
the power speetrum. These seales will eorrespond to angular sizes > 10 deg (at z ~ 1) and frequeney 
intervals of order 100 MHz whieh should be easily aehievable (but note that the foreground eleaning 
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Redshift 

Erequency 

(MHz) 

Average Signal 
(AtK) 

Angular scale 
(deg) 

Signal rms 
(^K - 1 MHz) 

Signal rms 
(itK - 10 MHz) 

0.5 

947 

156.5 

1.4 

36.2 

29.0 

1.0 

710 

254.8 

1.9 

29.8 

24.7 

1.5 

568 

351.0 

2.4 

25.9 

21.5 

2.0 

473 

427.8 

2.8 

23.2 

19.1 


Table 1: Average of the signal as a function of redshift/frequency as well as the signal rms (square root of the 
variance) for scales relevant for BAG 150 Mpc) assuming single dish observations with ^ 15 m dishes. 
The variance of the signal is calculated for a 3d smoothing window corresponding to the given angular and 
frequency scale (frequency in parentheses). 

scale will require even larger bandwidths). These angular requirements mean that we will not be 
able to use interferometers to probe these large scales since their primary beam is usually smaller 
than this (and we are assuming that mosaicking cannot be used to recover the large scales with the 
interferometer). 

The signal we are looking for fluctuates both in frequency and across the sky. We are not 
looking for the average intensity. In table 1 we show the expected signal rms fluctuations for scales 
related to the dish FoV and a couple of frequency intervals. 

5.3 Current and planned experiments 

First attempts at using intensity mapping have been promising, but have highlighted the chal¬ 
lenge of calibration and foreground subtraction. The Effelsberg-Bonn survey (Keip et al. 2011) has 
produced a data cube covering redshifts out to z = 0.07, while the Green Bank Telescope (GBT) has 
produced the first (tentative) detection of the cosmological signal thr'ough IM by cross-correlating 
with the WiggleZ redshift survey (Chang et al. 2010; Switzer et al. 2013; Masui et al. 2013). As 
probes to constrain cosmological parameters these measurements are, as yet, ineffective, but they 
do point the way to a promising future. 

As described above, we can divide the intensity mapping experiments into two types: single 
dish surveys and interferometers. In single dish surveys (e.g. using auto-correlations) each pointing 
of the telescope gives us one single pixel on the sky (though more dishes or feeds can be used to 
increase the field of view). This has the advantage of giving us the large scale modes by scanning 
the sky. Since brightness temperature is independent of dish size we can achieve the same sensi¬ 
tivity with a smaller dish although that will in turn limit the angular resolution of the experiment 
(a 30 arc min resolution at z ~ 1 would require a dish of about 50 m in diameter). One example 
is the GBT telescope as described above. BINGO (Battye et al. 2013) is a proposed 40m multi¬ 
receiver single-dish telescope to be situated in South America and aimed at detecting the HI signal 
atz ~ 0.3. 

Interferometers basically measure the Fourier transform modes of the sky. They have the 
advantage of easily providing high angular resolution as well being less sensitive to systematics 
that can plague the auto-correlation power. On the other hand, the minimum angular scale they can 
probe is set by their shortest baseline which can be a problem when probing the BAO scales. One 
example of a purpose built interferometer for intensity mapping is CHIME, a proposed array, aimed 
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at detecteding BAO at z ~ 1, made up of 20 x 100m cylinders, based in British Columbia, Canada. 
The pathfinder has 2 half-length cylinders, and the full experiment has 5 (CHIME Collaboration 
2012 ). 

The next generation of large dish arrays can also potentially be exploited for HI intensity 
mapping measurements. Such is the case of MeerKAT and ASKAR However, these interferometers 
do not provide enough baselines on the scales of interest (5m to 80m) so that their sensitivity to 
BAO will be small. The option is to use instead the auto-correlation information from each dish, 
e.g. make a survey using the array in single dish mode. The large number of dishes available with 
these telescopes will guarantee a large survey speed for probing the HI signal. The great example 
of this approach will be SKAl, the first phase of the SKA telescope, to be built in 2018. A HI 
intensity mapping survey will turn SKA phase 1 into a state of the art cosmological probe and we 
discuss its use in the next sections. 

6. Surveys with the SKA 

In terms of HI intensity mapping surveys with SKA, several factors need to be considered: 

• SKA phase 1 (SKAl) is planned to have 3 different instruments and all of them can in 
principle provide an intensity mapping survey at redshifts below 5 (after reionization). This 
includes SKAl-LOW which is planned to operate at frequencies below 350 MHz (so z >3). 
SKAl-MI D and SKAl-SUR are on the other hand planned to work down to 350 MHz, 
although the deployment of the required bands might not happen at the same time. 

• Both SKAl -MI D and SUR can be used in "single dish mode" where the auto-correlations are 
used to probe the large cosmological scales, or in "interferometer mode" better at resolving 
the smaller scales. For SKA-LOW we will only consider the interferometer mode. In prin¬ 
ciple one can also consider a survey with SKA-LOW using the auto-correlation of the beam 
from each station (from the beam-former). This could be useful for probing large scales with 
a full sky survey but wouldn’t be optimised for BAO since at ~ 350 MHz the beam would 
be around 3 deg^. Other option would be to consider the dipoles as the correlation elements 
instead of stations. These are possibilities that need to be further explored, but since SKA- 
LOW is more focused on high redshifts where reionization effects need to be factored in, for 
the current analysis we decided to concentrate only on the standard interferometer case as an 
example for SKA-LOW. 

• Both SKAI-MID and SUR will have several bands and, according to the current design, we 
will need to use two bands to cover the full redshift range from z = 0 to z = 3. 

• The SKA is probably going to be built in 3 phases (or even 4 if we consider the SKA precur¬ 
sors, MeerKAT and ASKAR). We should therefore consider in the analysis two more phases 
besides SKAl: SKA Rhase 0 - An "early science" phase of deployment for each SKAl com¬ 
ponent (SKAl-LOW, SKAl-SUR, SKAl-MlD), where sensitivity has grown to about 50% 
of its fully specified level and the full SKA (SKA Rhase 2), with lOx the sensitivity and 20x 
the field of view of SKAl. 
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Telescope 

Band [MHz] 

Z 

Target freq. 

Tinst[K] 

Nd 

D [m] 

Aeff[m2]U) 

0| [deg^j 

Nb 

SKAO-MID 

900 - 1760 

(0) - 0.58 

1310 [MHz] 

20 

127 

15 

140 

0.51 

1 

SKAO-SUR (4) 

650- 1800 

(0)- 1.19 

1300 

30 

48 

15 

140 

0.51 

36 

SKAl-MID 

350 - 1050 

0.35 - 3.06 

700 

28 

254 

15 

140 

1.78 

1 

SKAl-MID 

900 - 1760 

(0) - 0.58 

1310 

20 

254 

15 

140 

0.51 

1 

SKAl-SUR 

350 - 900 

0.58 - 3.06 

710 

50 

60 

15 

140 

1.71 

36 

SKAl-SUR (4) 

650- 1800 

(0)- 1.19 

1300 

30 

96 

15 

140 

0.51 

36 

SKAl-LOW 

50 - 350 

3.06 - 27 

110 

40 

911 

35 

925 (f) 

28 

3 

SKA2 (9 

300 - 1000 

0.42 - 3.73 

500 

15 

4000 

10 

63 

8.0 

10 


Table 2: Telescope/survey configurations. For frequency dependent quantities, the values are calculated 
at the indicated target frequency. In order to compare between different setups, a total of 10,000 hours is 
assumed for each survey. Both single dish and interferometer data are considered for each survey when 
possible. Survey area is chosen to optimise detection at the required cosmological scale. For all surveys we 
take the total temperature to be Tsys = + Tsky with = 0.1 + Tinst and « 60 (300MHz/ 

K. 

Notes: (a) MID and SUR telescopes are already assumed to include MeerKAT (64 dishes) and ASKAP (36 
dishes) respectively, (b) For MID and SUR the largest band of the combined telescopes is indicated assuming 
that outside the overlapping band only the corresponding dishes are used in the sensitivity calculations, (c) 
Diameter of dish or station, (d) Effective collecting area of the dish or stations, (e) Primary beam or 
instantaneous field of view of the telescope at the target frequency - assumed to go as 1 /unless stated 
otherwise. For the combined telescopes, the smallest beam of the two telescopes is used, (f) Assuming that 
all ASKAP PAFs will be replaced to meet the SKA 1-SUR band and instrument temperature of 30K. (g) Only 
500 MHz instantaneous bandwidth, (b) The combined PAF beam (Nt x 0j) is assumed constant below the 
target (critical) frequency as explained in the text, (i) Note that bandx Nt is fixed at 300 MHz. (j) Assumed 
to be constant below the target frequency and going as 1 /above it. (k) Beams can point in different 
directions so no overlap is assumed. (1) Values here are completely indicative. We choose to consider an 
interferometer type experiment, targeting the high (z > 1) redshift region and a FoV large enough for BAG 
scales. We only fix the sensitivity to be about 10 times what is assumed for SKAl and the total field of view 
(including number of beams) to be about 20 times SKAl at the target frequency. 


The baseline design for SKAl is described in Dewdney et al. (2013) and further updated in 
Braun (2014). To summarise, SKAl-MID will comprise of 254 single pixel feed dishes (including 
MeerKAT 64 dishes), to be built in South Africa, and 96 dishes (including ASKAP 36 dishes) fitted 
with 36 beam PAFs to increase the field of view fo be sef in Ausfralia. SKAl-LOW consisfs of 911 
aperfure array sfafions each wifh 35 m diamefer. Following the details in table 1 of Santos et al. 
(2014) we list in table 2 the different surveys that are considered in this chapter. 

7. Cosmological constraints 

7.1 Dark energy and spatial curvature 

Surveys of large-scale structure are a rich source of information about the geometry and ex¬ 
pansion history of the Universe. The baryon acoustic oscillations (BAO) are a preferred clustering 
scale imprinted in the galaxy distribution, originating from the time when photons and baryonic 
matter were coupled together in the early Universe. By using them as a statistical ‘standard ruler’, 
one can obtain constraints on the expansion rate, H{z), and (angular) distance-redshift relation. 
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Figure 4; Survey volumes and redshift range for various current and future surveys (volume calculated at 
the central redshift). 

Da{z), as functions of redshift, as has been done successfully with recent large galaxy redshift 
surveys such as BOSS and WiggleZ. Measuring these functions is vital for testing theories of dark 
energy which seek to explain the apparent acceleration of the cosmic expansion, as they constrain 
its equation of state, w = P/p, and thus its physical properties. Shedding light on the behaviour of 
dark energy - especially whether w deviates from — 1 and whether it varies in time - is one of the 
foremost problems in cosmology. 

To precisely measure the BAO feature in the matter correlation function, which appears as a 
‘bump’ at comoving separations of r ss 100 /j^* Mpc, one needs to detect many galaxies (in order 
to reduce shot noise), and to cover as large a survey volume as possible (in order to reduce sample 
variance). Intensity mapping has a few major advantages over conventional galaxy surveys for 
this task. IM surveys can map a substantial fraction of the sky with low angular resolution in a 
short period of time. Combined with the wide bandwidths of modern radio receivers, this makes 
it possible to cover extremely large survey volumes and redshift ranges in a relatively short time, 
helping to beat down sample variance (see Fig. 4). 

While individual galaxies cannot in general be resolved, each telescope pointing measures the 
integrated emission from many galaxies, making the total signal easier to detect and reducing the 
shot noise. All that is required is to obtain sufficient flux sensitivity to detect the integrated 21cm 
emission and to have sufficient resolution to resolve the required scales at a given redshift. Figure 5 
summarises the expected constraints from the SKA HI IM surveys for two relevant target scales: the 
BAO scale at k ~ 0.074 Mpc^' and a very large scale, past the equality peak at k ~ 0.01 Mpc^^ We 
see the huge constraining power of these surveys. In particular, due to the large volumes probed, 
they will be unmatched on ultra-large scales. Even at BAO scales, both SKAl-MID and SUR 
present constraints not far from Euclid while only using a ~ 2 year survey (the full Euclid requires 
about 5 years). Moreover, SKAl-EOW will be able to make a detection at z ~ 4 which again will 
be an unique feature. 

Eor the BAO scales (see Table 1 and Eig. 5, left panel), the angular resolution of the Phase 1 
SKA dishes is such that these scales are best matched to an autocorrelation survey at low redshift. 
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Figure 5: Left: Constraints (noise over signal) from SKA HI IM surveys for BAG scales (k ~ 0.074 Mpc^ *) 
as a function of redshift. Dashed line shows the BAG detection threshold. Assumptions: 10,000 hours 
observation, 25,000 deg^ survey and bins of dz = 0.1, except for SKAl-MID in interferometer mode and 
SKA-LGW where 1,000 deg^ and dz = 0.3 was taken. The results for SKAO band 2 (low z), where only 
50% of the dishes are used, was not shown as the results are very similar to SKAl. The lower green curve 
shows what would be expected from a SKA2 IM survey (in interferometer mode) optimised for high-z. The 
grey curve shows what can be expected for a two-year Ha galaxy survey with similar depth as Euclid but 
over a smaller sky area. Right: Constraints (noise over signal) from SKA HI IM surveys for large scales, 
past the equality peak (k ^ 0.01 Mpc^^) as a function of redshift. A value below 1 would imply a detection. 
For SKAl-SUR band 2, the available 500 MHz bandwidth was chosen at the low end of the band in order to 
probe higher redshifts. SKAl-MID band 2 is not shown as it is constrained to low redshifts (z < 0.5) with 
the current band specs. Dashed line indicates what can be achieved with SKAO (50% of SKAl) which is 
quite similar to SKAl. Note that, in order to be as generic as possible, we did not include the foreground 
contamination in this analysis since the results will depend on the cleaning method adopted. The foreground 
residuals should degrade these constraints, specially on scales of the order of the frequency band. 

and an interferometric survey at higher redshift. Measurements of the equation of state are most 
critical at lower redshifts, z < 1.5, where dark energy begins to dominate the cosmic expansion. 
Bull et al. (2014b) show that a 10,000 hour and 25,000 deg^ autocorrelation survey on either SKAl- 
MID or SUR will be capable of producing high-precision constraints on w, bettering all existing 
surveys due to its large survey area (see Fig. 6). While the resulting dark energy ‘figure of merit’ is 
a factor of ~ 3 worse than forecasts for a future Euclid galaxy redshift survey when combined with 
Planck CMB data and BOSS low-redshift BAO measurements (since Euclid cannot probe redshifts 
below 0.7), a phase 1 IM survey will nevertheless be of great utility in superseding other low-z 
measurements in the joint analyses that will produce the best constraints on w. 

Another important quantity that can be derived from BAO measurements is the spatial curva¬ 
ture, Q.K, which describes the global geometry of the observable Universe. A key prediction of the 
prevailing inflationary theory of the early Universe is that the spatial curvature should be extremely 
small. Current constraints (e.g. Planck Collaboration 2014) find | < 10^^, but a precision mea¬ 
surement at the ~ few x 10^^ level is needed to really put pressure on inflationary models (e.g. 
Kleban & Schillo 2012). In combination with Planck CMB data, an SKA IM survey would be able 
to approach this value, measuring 

< 10-3 (7.1) 

with 68% confidence (Bull et al. 2014a). 
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Figure 6: Left: Predicted constraints from SKA on dynamical dark energy parameters. We show predicted 
constraints from SKAl IM and SKA2 galaxy, compared with predictions for Euclid. Right: Predicted 
constraints from SKA on the unparameterized growth function /dg from the SKAl (galaxy and IM) and 
the SKA2 galaxy survey, compared with predicted constraints coming from the Euclid galaxy survey. Both 
constraints include Planck+BOSS priors. 

7.2 Growth of structure 

Viewed in redshift space, the matter distribution is anisotropic due to the distorting effect of 
peculiar velocities in the line of sight direction. Coherent peculiar velocities on large scales encode 
information about the history of the growth of structure in the Universe through their dependence 
on the linear growth rate, f{z), which can be measured from the degree of anisotropy of the redshift- 
space correlation function. The growth rate is directly related to the strength of gravity, and so is an 
extremely useful tool for probing possible deviations from general relativity that have been invoked 
as an alternative to dark energy to explain cosmic acceleration. 

Intensity mapping and galaxy surveys do not measure the linear growth rate directly, but are 
instead sensitive to simple combinations of f{z), the bias b{z), and the overall normalisation of 
the power spectrum 03 ( 2 ). A reasonable choice of parametrisation is to take the combinations 
(/agj^CJg). As shown in Raccanelli et al. (2014), a 10,000 hour and 25,000 deg^ SKA phase 1 
intensity mapping autocorrelation survey will be capable of measuring /ag with high precision 
over a wide redshift range, obtaining sub-1% constraints in the range 0.05 ^ z < 1-0 with Band 2 
of SKAl-MID or SUR, and reaching out to z ss 2.0 with ~ 4% precision using Band 1 of MID/SUR 
(see Fig. 6 ). 

At low redshifts, these figures are highly complementary to (e.g.) a Euclid galaxy redshift 
survey, which should obtain ~ 0.5% measurements of fOg in the interval 0.7 < z < 2.0. By com¬ 
parison, SKAl-MID/SUR will have ~ 0.5% measurements for z ~ 0.3 - 0.7. 

7.3 Probing ultra-large scales 

As briefly mentioned above, there is important information that can be extracted from the 
ultra-large scale modes of order and above the cosmological horizon (see Fig. 5, right panel). We 
refer the reader to Camera et al. (2014) and references therein for an extensive description of the 
ultra-large scale effects briefly mentioned here, as well as to the ways by which the SKA will be 
able to tackle successfully the technical problems arising when trying to access those scales. 
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Figure 7: Left: Power spectrum of dark matter (solid) and HI (dashed) at z = 0.4 (blue, top) and z = 2.5 
(red, bottom), with /nl = 10. Right: Forecast Iff error on /nl (top); HI Gaussian bias (middle); effective 
IM survey volume (bottom). From Camera et al. (2013). 


One of the most important features on horizon scales is primordial non-Gaussianity. Many 
models of inflation predict a small amount of non-Gaussianity in the statistical distribution of pri¬ 
mordial fluctuations. This produces a signal in the bispectrum, but also in the power spectrum 
- since primordial non-Gaussianity induces a scale-dependent correction to the Gaussian bias: 
b — b-\-M>. This correction grows on large scales as oc for primordial non-Gaussianity 

of the local type, where /nl is the non-Gaussian parameter. 

In Camera et al. (2013), an analysis is given of the constraining power of IM surveys over 
non-Gaussianity; their results are summarised in Fig. 7. This shows that the forecast errors on 
/nl can be taken down towards < 3 for a deep enough survey with sufficient dishes. We 
recast their analysis according to the updated specifics of Table 2, and adopt a SKAl-MID IM 
survey operating for 10,000 hours at a system temperature of 20 K. The chosen bandwidth is 
therefore 350 — 1050, where we keep the last, high-frequency bins between 1000 and 1050 MHz 
for foreground removal. The bandwidth is further subdivided into constant frequency bins of 10 
MHz width, collected into ‘chunks’ of 20 by 20 bins in order to construct a 65 by 65 tomographic 
matrix. (To deal with the large number of bins, we use a block diagonal tomographic matrix where 
we correct for the overlapping, as described in Camera et al. 2013.) Such a configuration eventually 
yields a constraint on the primordial non-Gaussianity parameter 

<^/nl = 2-3, (7.2) 

namely more than three times better than the current constraint from Planck (using the large-scale 
structure convention). 

SKA IM surveys will also allow us to test Einstein’s theory of general relativity for the first 
time on horizon scales. One of the most interesting effects predicted by general relativity is the cor¬ 
rection to the standard Newtonian approximation for the observed galaxy overdensity. The Kaiser 
redshift-space distortion term is a relativistic correction that is significant on small scales. Further 
relativistic corrections include other redshift terms (Doppler and gravitational), Sachs-Wolfe (SW) 
type terms, and integrated contributions - from weak leasing magnification, time-delay and ISW 
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Figure 8: Left. Displacement field power spectrum for Zs = 2 (solid black line) and Zs = 3 (solid magenta 
line) and the corresponding measurement errors using the SKA2 specifications and assuming no redshift 
evolution in the HI density. Right. Displacement field power spectrum for Zs = 2 and the corresponding 
measurement errors using the SKAl specifications and an increasing £2 hi( 2) with redshift. 

terms (see Camera et al. (2014)): 

St = {bm + Ab)5-^-^^n%{n-Y) + 5GR, (7.3) 

The terms in 5 gr grow on ultra-large scales and need to be accounted for. 

It turns out that for IM, some of these corrections are strongly suppressed; e.g. the weak tens¬ 
ing magnification and the integrated time-delay terms are zero for IM (Hall et al. 2013; Duniya et al. 
2013). However, the Doppler terms and the ISW contribution remain and need to be incorporated. 

At the same time as testing general relativity, one can also test specific modified gravity the¬ 
ories which deviate from general relativity on large scales. SKA IM surveys should enhance con¬ 
siderably the current constraints on parameters that describe modified gravity solutions; this is an 
area for future work. 

7.4 Weak Lensing 

HI intensity mapping can also be used to measure weak gravitational lensing. Gravitational 
magnification will have an effect on the clustering properties of galaxies that is coherent over a large 
range in redshift. The effect can be detected by applying a quadratic estimator to the brightness 
temperature maps. HI intensity mapping with SKA would allow for weak lensing measurements at 
higher redshifts than are possible with more traditional weak lensing methods based on the shearing 
of galaxy images in the visible. The technique is more fully described in the Weak Lensing chapter 
of this proceeding and in Pourtsidou & Metcalf (2014a,b). 

The signal-to-noise of such a measurement is strongly dependent on the HI density at z ~ 2 — 3 
which is not yet strongly constrained by observations (see figure 1). For a conservative assumption 
of no evolution in Dhi(z), we find that SKA-MID phase 2 should be able to measure the shape of 
lensing power spectrum and its evolution between z = 2 and 3 (see Fig. 8, left panel). This assumes 
a 20,000 sq.deg. survey. 

If Dhi(z) increases by a factor of ~ 5 by redshift 3, as seen in the DLA observations from 
Peroux et al. (2003), significantly higher signal-to-noise can be achieved and SKA phase 1 should 
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be capable of measuring the lensing power spectrum and its evolution with high accuracy, as shown 
in Fig. 8, right panel. This would be a unique probe of the expansion history of the universe and 
gravity at this range in redshift. 

8. Technical challenges 

The idea of using intensity mapping to reconstruct the large scale structure of the universe 
bring radio astronomy back to what has been one of its greatest successes - mapping out cosmo¬ 
logical diffuse emission. Indeed, tremendous progress has been made in mapping out the cosmic 
microwave background (CMB) and the hope is that many of the techniques developed there may 
inform us on how best to proceed. We will now briefly address some of the problems that need to 
be tackled if we are to move forward with this technique. 

For a start it is clear that different redshift ranges will require different observation "modes": at 
high redshift it is preferable to use interferometers while at low redshifts it should be more efficient 
to work with single dishes (or "auto-correlation" mode in the parlance of radio astronomers). This 
is not a watertight rule. For example, with the clever use of phase arrays or cylindrical arrays, it 
should be possible to construct interferometers with short baselines and large fields of view hence 
accessing larger wavelengths at lower redshifts. But, for now, this separation of scales/redshift is 
useful in guiding us through the issues. 

At low redshifts one needs to perform a classic CMB-like observation which is to raster scan 
the sky building up a rich set of cross-linked scans that cover as much area as possible with as much 
depth as is necessary. The key problems are then dealing with the long term drifts in the noise (the 
1 // noise which is ubiquitous in such experiments) and accurately calibrating the overall signal. 
Usually, the first problem can be dealt with sufficiently fast scan speed (such that the bulk of the 
signal is concentrated in frequency in the regime where the 1// has died off and the noise is 
effectively uncorrelated) but this can be difficult to achieve with large dishes such as the ones that 
are envisaged in current and future experiments. For some setups, the fortuitous configuration of 
elevation and location mean that drift scanning may lead to a fast enough scan speed. 

With regards to calibrating single dish experiments, this is a source of major concern. Major 
systematic effects to be tackled are spillover and sidelobe pickup as well as gain drifts. Again, 
these are issues that have been tackled successfully in the analysis of CMB data although novel 
approaches can be envisaged. So, for example, the BINGO experiment (Battye et al. 2013) propose 
to use a partially illuminated aperture and a fixed single dish, minimising the problems that arise 
from moving parts. Another intriguing possibility is, for a cluster of single dishes working in 
autocorrelation mode, to use the cross correlation data for calibrating off known sources. This 
means that in principle, calibrating the gains should be straightforward using the interferometer 
data since the high resolution will allow access to a good sky model. 

In the case of interferometric measurements, the challenge is to capture as much of the long 
wavelength modes as possible. The largest wavelength is set by the smallest baseline which implies 
that arrays with large dishes will not adequately sample BAG scales at low redshifts in interferom¬ 
eter mode. To mitigate this problem, one can work with dense aperture arrays which can be a 
possible design for SKA2 (or just use smaller dishes and pack them closer together). This results 
in smaller baselines and a larger field of view for the interferometer, but reduces its total effective 
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collecting area (and thus its sensitivity) if we want to maintain the number of correlations low. 
Alternatively, reflector designs have been proposed - for example, long cylindrical reflectors with 
many closely-spaced receivers installed along the cylinder (Shaw et al. 2014). This provides a large 
number of short baselines, and a primary beam that is ~180° in one direction but much narrower 
along the orthogonal direction. 


9. Conclusions 

Neutral hydrogen (HI) intensity mapping is set to become a leading cosmology probe during 
this decade. Intensity mapping at radio frequencies has a number of advantages over other large- 
scale structure surveys methodologies. Since we only care about the large-scale characteristics 
of the HI emission, there is no need to resolve and catalogue individual objects, which makes it 
much faster to survey large volumes. This also changes the characteristics of the data analysis 
problem: rather than looking at discrete objects, one is dealing with a continuous field, which 
opens up the possibility of using alternative analysis methods similar to those applied (extremely 
successfully) to the CMB. Thanks to the narrow channel bandwidths of modern radio receivers, 
one automatically measures redshifts with high precision too, bypassing one of the most difficult 
aspects of performing a galaxy redshift survey. 

These advantages, combined with the rapid development of suitable instruments over the com¬ 
ing decade, should turn HI intensity mapping into a highly competitive cosmological probe. One of 
the key instruments that can be used for this purpose is phase I of the SKA. A large sky survey with 
this telescope should be able to provide stringent constraints on the nature of dark energy, modified 
gravity models and the curvature of the Universe. Moreover, it will open up the possibility to probe 
Baryon Acoustic Oscillations at high redshifts as well as ultra-large scales, beyond the horizon size, 
which can be used to constrain effects such as primordial non-Gaussianity or potential deviations 
from large-scale homogeneity and isotropy. 

Several challenges will have to be overcome, however, if we want to use this signal for cos¬ 
mological purposes. In particular, cleaning of the huge foreground contamination, removal of any 
systematic effects and calibration of the system. Foreground cleaning methods have already been 
tested with relative success taking advantage of the foreground smoothness across frequency but 
novel methods need to be explored in order to deal with more complex foregrounds. Other con¬ 
taminants, such as some instrumental noise bias that shows up in the auto-correlation signal, can 
in principle be dealt with the same methods. Ultimately, we should deal with the cleaning of the 
signal and the map making at the same time. This will require even more sophisticated statistical 
analysis methods and it will be crucial to take on such an enterprise in the next few years in order 
to take full advantage of this novel observational window for cosmology. 
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